On the use of high-level information in speaker and language recognition
نویسندگان
چکیده
Automatic Speaker Recognition systems have been largely dominated by acoustic-spectral based systems, relying in proper modelling of the short-term vocal tract of speakers. However, there is scientific and intuitive evidence that speaker specific information is embedded in the speech signal in multiple shortand long-term characteristics. In this work, a multilevel speaker recognition system combining acoustic, phonotactic and prosodic subsystems is presented and assessed using NIST 2005 Speaker Recognition Evaluation data. For language recognition systems, the NIST 2005 Language Recognition Evaluation was selected to measure performance of a high-level language recognition systems.
منابع مشابه
شبکه عصبی پیچشی با پنجرههای قابل تطبیق برای بازشناسی گفتار
Although, speech recognition systems are widely used and their accuracies are continuously increased, there is a considerable performance gap between their accuracies and human recognition ability. This is partially due to high speaker variations in speech signal. Deep neural networks are among the best tools for acoustic modeling. Recently, using hybrid deep neural network and hidden Markov mo...
متن کاملPragmalinguistic and Sociopragmatic Recognition of High and Low Level EFL Learners
This study investigated the effects of English as foreign language (EFL) proficiency on what the authors of this study called pragmalinguistic and sociopragmatic recognition of EFL learners. To elicit the data, the study used two types of pragmatic measures: a pragmalinguistic recognition (PLR) test and a sociopragmatic recognition (SPR) test. Both tests were developed by the researchers of thi...
متن کاملPragmalinguistic and Sociopragmatic Recognition of High and Low Level EFL Learners
This study investigated the effects of English as foreign language (EFL) proficiency on what the authors of this study called pragmalinguistic and sociopragmatic recognition of EFL learners. To elicit the data, the study used two types of pragmatic measures: a pragmalinguistic recognition (PLR) test and a sociopragmatic recognition (SPR) test. Both tests were developed by the researchers of thi...
متن کاملSpeaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation
A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...
متن کاملSpeaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation
A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006